"Express Mail" Mailing Label No, EL917901516US 



PATENT APPLICATION 
ATTORNEY DOCKET NO. SUN-P7077-SPL 



METHOD AND APPARATUS FOR SOLVING 
AN UNCONSTRAINED GLOBAL 
OPTIMIZATION PROBLEM 
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Related Application 

The subject matter of this application is related to the subject matter in a 
co-pending non-provisional application by the same inventors as the instant 
application entitled, "Applying Term Consistency to the Solution of 
Unconstrained Interval Global Optimization Problems," having serial number 
09/991,476, and filing date 16 November 2001 (Attorney Docket No. SUN- 
P7076-SPL). 

BACKGROUND 

Field of the Invention 

The present invention relates to performmg arithmetic operations on 
interval operands within a computer system. More specifically, the present 
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invention relates to a method and an apparatus for using a computer system to 
solve an unconstrained global optimization problem with interval arithmetic. 

Related Art 

Rapid advances in computing technology make it possible to perform 
trillions of computational operations each second. This tremendous 
computational speed makes it practical to perform computationally intensive tasks 
as diverse as predicting the weather and optimizing the design of an aircraft 
engine. Such computational tasks are typically performed using machine- 
representable floating-point numbers to approximate values of real numbers. (For 
example, see the Institute of Electrical and Electronics Engineers (IEEE) standard 
754 for binary floating-point numbers.) 

In spite of their limitations, floating-point numbers are generally used to 
perform most computational tasks. 

One limitation is that machine-representable floating-point numbers have a 
fixed-size word length, which limits their accuracy. Note that a floating-point 
number is typically encoded using a 32, 64 or 128-bit binary number, which 
means that there are only iP', 2^^ or 2^^^ possible symbols that can be used to 
specify a floating-point number. Hence, most real number values can only be 
approximated with a corresponding floating-point number. This creates 
estimation errors that can be magnified through even a few computations, thereby 
adversely affecting the accuracy of a computation. 

A related limitation is that floating-point numbers contain no information 
about their accuracy. Most measured data values include some amount of error 
that arises from the measurement process itself This error can often be quantified 
as an accuracy parameter, which can subsequently be used to determine the 
accuracy of a computation. However, floating-point numbers are not designed to 

2 

Attorney Docket No. SUN-P7077-SPL Inventors: Walster et a!, 

ARPHASUN MICROS YSTEMS\SUN-P7077-SPL\SUYN-P7077 APPLICATION3.DOC 



keep track of accuracy information, whether from input data measurement errors 

or machine rounding errors. Hence, it is not possible to determine the accuracy of 
a computation by merely examining the floating-point number that results from 
the computation. 

Interval arithmetic has been developed to solve the above-described 
problems, hiterval arithmetic represents numbers as intervals specified by a first 
(left) endpoint and a second (right) endpoint. For example, the interval [a, b], 
where a < 6, is a closed, bounded subset of the real numbers, R, which includes a 
and b as well as all real numbers between a and b. Arithmetic operations on 
interval operands (interval arithmetic) are defined so that interval results always 
contain the entire set of possible values. The result is a mathematical system for 
rigorously bounding numerical errors from all sources, including measurement 
data errors, machine rounding errors and their interactions. (Note that the first 
endpoint normally contains the "infimum", which is the largest number that is less 
than or equal to each of a given set of real numbers. Similarly, the second 
endpoint normally contains the "supremum", which is the smallest niunber that is 
greater than or equal to each of the given set of real numbers.) 

One commonly performed computational operation is to perform global 
optimization to find a global minimum of an objective function. This can be 
accomplished using any members of a set of criteria to delete boxes, or parts of 
boxes that cannot contain the global minimum/*. The set of criteria includes: 

(1) the/^6flr-criterion, wherein if fjbar is the smallest upper bound so 
far computed on/^, then any point x for whichy(x) >f_bar can be deleted. 
Similarly, any box X can be deleted if infifiX)) >f_bar\ 

(2) the monotonicity criterion, wherein if g(x) is the gradient of f 
evaluated at x, then any point x for which g(x) ^ 0 can be deleted. Similarly, any 
box X can be deleted if 0 f g(X); 
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(3) the convexity criterion, wherein if Huix) is the /-th diagonal 

element of the Hessian of/ then any point x for which //„(x) < 0 (for /=!,.. .,^) 

can be deleted. Similarly, any box X can be deleted if //«(X) < 0 (for 

and 

5 (4) the stationary point criterion, wherein points x are deleted using the 

interval Newton technique to solve the equation g(x) = 0. 

These criteria work best "in the small" when the objective function /is 
approximately quadratic. For large intervals containing multiple stationary points 
the above criteria might not succeed in deleting much of a given box. In this case 
1 0 the box is split into two or more sub-boxes that are processed independently. By 
this mechanism all the global minima of a nonlinear objective ftinction can be 
found. 

One problem is applying this procedure to large ^^-dimensional interval 
vectors (or boxes) that contain multiple local minima. In this case, the process of 
15 splitting in w-dimensions can lead to exponential grovrth in the number of boxes 
to process. 

It is well known that this problem (and even the problem of computing 
"sharp" bounds on the range of a function of /7-variables over an /7-dimensional 
box) is an "NP-hard" problem. In general, NP-hard problems require an 
20 exponentially increasing amount of work to solve as the number of independent 
variables, increases. 

Because NP-hardness is a worst-case property and because many practical 
engineering and scientific problems have relatively simple structure, one problem 
is to use this simple structure of real problems to improve the efficiency of 
25 interval global optimization algorithms. 

Hence, what is needed is a method and an apparatus for using the structure 
of a nonlinear objective function to improve the efficiency of interval global 
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optimization software. To this end, what is needed is a method and apparatus that 
efficiently deletes "large" boxes or parts of large boxes that the above criteria can 
only split. 



5 SUMMARY 

The present invention combines various methods to speed up the solution 
of systems of nonlinear equations. The combined methods use the structure of the 
objective function to efficiently delete parts of all of large sub-boxes that would 
otherwise have to be split. One embodiment of the present invention provides a 

1 0 system that solves an unconstrained interval global optimization problem 
specified by a fimction/ wherein /is a scalar fiinction of a vector 
X = (x;, X2, xs, ... Xn). The system operates by receiving a representation of the 
function/ and then performing an interval global optimization process to 
compute guaranteed bounds on a globally minimum value/*' of the function/x) 

15 and the location or locations x* of the global minimum. While performing the 
interval global optunization process, the system deletes all of part of a subbox X 
for which/x) >f_bar, wherein/^Z>ar is the smallest upper bound on/*" that has 
been so far computed. This is called the "fjyar test". The system applies term 
consistency to ihcfjbar test over the subbox X to increase that portion of the 

20 subbox X that can be proved to violate thQ fjbar test. 

For any fiinction of w-variables/x) there are different ways to analytically 
solve for a component Xj of the vector x. For example, one can write f(x) = 0 in an 
equivalent form/x) = g(xj) - h{x) ^ 0, where g{xj) is a term in / for which it is 
possible to solve g(x;) = y for any Xj using g ^(y). For each of these 

25 rearrangements, if a given interval box X is used as an argument of h, and if 
X/=Xjn X) where X) =g\hiX)\ then the new interval X/ for the/th 
component of X, is guaranteed to be at least as narrow as the original Xj . 
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This process is then iterated using different terms g of the function / After 

reducing any element Xj of the box X to X/, the reduced value can be used in X 
thereafter to speed up the reduction process using other component functions and 
terms thereof 

5 Hereafter, the notation g{xj) for a term of the function / (x) impHcitly 

represents any term of any component function. This eliminates the need for 
additional subscripts that do not add clarity to the exposition. 

One embodiment of the present invention provides a system that receives a 
representation of the function / and stores the representation in a memory. Next, 

10 the system performs an interval global optimization process to compute 
guaranteed bounds on a globally minimum value of the function^x) over a 
subbox X. This interval global optimization process applies term consistency to a 
set of relations associated with the function / over the subbox X, and excludes any 
portion of the subbox X that violates any of the relations. It also applies box 

15 consistency to the set of relations associated with the function / over the subbox 
X, and excludes any portion of the subbox X that violates any of the relations. 
The interval global optimization process also performs an interval Newton step on 
the subbox X to produce a resulting subbox Y, wherein the point of expansion of 
the interval Newton step is a point x within the subbox X, and wherein performing 

20 the interval Newton step involves evaluating the gradient g(x) of the function/x) 
using interval arithmetic. 

In a variation on this embodiment, while applying term consistency, the 
system symbolically manipulates an equation within the computer system to solve 
for a term g(Xj), thereby producing a modified equation g{Xj) = h{x), wherein the 

25 term g(xj) can be analytically inverted to produce an inverse function g'^Cy). Next, 
the system substitutes the subbox X into the modified equation to produce the 
equationg(X;) = /z(X), and solves for = g'XKX)). The system then intersects 
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X) with the interval Xj to produce a new interval X^j. The resulting subbox 
contains all solutions of the equation within the subbox X, and the size of the new 
subbox X^ is less than or equal to the size of the subbox X. 

In a variation on this embodiment, while performing the interval global 
optimization process, the system keeps track of a smallest upper hounAfJbar of 
the functionXx) and removes from consideration any subbox X for which 
/X) >f_bar. In this variation, applying term consistency to the fjbar relation 
involves applying term consistency to the inequality /(x) < f_bcir over the subbox 
X. In a further variation, applying box consistency to the f_bar relation involves 
applying box consistency to the inequality /x) < f bar over the subbox X. 

In a variation on this embodiment, while performing the interval global 
optimization process, the system determines the gradient g(x) of the function /(x) 
and removes from consideration any subbox for which any element of g(x) is 
bounded away from zero, thereby indicating that the subbox does not include a 
stationary point of fix). In this variation, applying term consistency to the set of 
relations involves applying term consistency to each component gi(x)=0 (/=/, .„,n) 
of g(x)=0 over the subbox X. In a further variation, applying box consistency to 
the set of relations involves applying box consistency to each component 
g,(x)=0 (z=7, ...,«) of g(x)=0 over the subbox X. 

In a variation on this embodiment, while performing the interval global 
optimization process, the system determines diagonal elements /f„(x) {i=l,...,n) of 
the Hessian of the fiinctionXx), and removes from consideration any subbox for 
which a diagonal element of the Hessian is always negative, which indicates that 
the fimction/is not convex and consequently does not contain a global minimum 
within the subbox. In this variation, applying term consistency to the set of 
relations involves applying term consistency to each inequality 
Hu(x) > 0 (/=1, over the subbox X. In a further variation, applying box 
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consistency to the set of relations involves applying box consistency to each 
inequality Hu{x) > 0 (/=1, ,..,n) over the subbox X. 

In a variation on this embodiment, performing the interval Newton step 
involves: computing the Jacobian J(x,X) of the gradient g evaluated as a function 

5 of a point x over the subbox X; computing an approximate inverse B of the center 
of J(x,X); and using the approximate inverse B to analytically determine the 
system Bg(x), wherein g(x) is the gradient of the functionXx), and wherein g(x) 
includes components glx) ...,^). In this variation, applying term consistency 
to the set of relations involves applying term consistency to each component 

10 (Bg(x)), = 0 (/=1, ...,n) for each variable (/=1, ...,«) over the subbox X. In a 
further variation, applying box consistency to the set of relations involves 
applying box consistency to each component (Bg (x)); = 0 (i=\, .,.,n) for each 
variable Xj (i=l,...,n) over the subbox X. 

In a variation on this embodiment, the system terminates attempts to 

1 5 further reduce the subbox X when the width of X is less than a first threshold 
value, and the magnitude of XX) is less than a second threshold value. 

BRIEF DESCRIPTION OF THE FIGURES 
FIG. 1 illustrates a computer system in accordance with an embodiment of 
20 the present invention. 

FIG. 2 illustrates the process of compiling and using code for interval 
computations in accordance with an embodiment of the present invention. 

FIG. 3 illustrates an arithmetic unit for interval computations in 
accordance with an embodiment of the present invention. 
25 FIG. 4 is a flow chart illustrating the process of performing an interval 

computation in accordance with an embodiment of the present invention. 
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FIG. 5 illustrates four different interval operations in accordance with an 
embodiment of the present invention. 

FIG. 6 is a flow chart illustrating the process of finding an interval solution 
to a nonlinear equation in accordance with an embodiment of the present 
invention. 

FIG. 7A is a first portion of a flow chart illustrating the process of solving 
an unconstrained interval global optimization problem in accordance with an 
embodiment of the present invention. 

FIG. 7B is a second portion of a flow chart illustrating the process of 
solving an unconstramed interval global optimization problem in accordance with 
an embodiment of the present invention. 

FIG. 7C is a third portion of a flow chart illustrating the process of solving 
an unconstrained interval global optimization problem in accordance with an 
embodiment of the present invention. 

FIG. 7D is a fourth portion of a flow chart illustrating the process of 
solving an unconstrained interval global optimization problem in accordance with 
an embodiment of the present invention. 

DETAILED DESCRIPTION 

The following description is presented to enable any person skilled in the 
art to make and use the invention, and is provided in the context of a particular 
application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
principles defined herein may be applied to other embodiments and applications 
without departing from the spirit and scope of the present invention. Thus, the 
present invention is not limited to the embodiments shown, but is to be accorded 
the widest scope consistent with the principles and features disclosed herein. 
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The data structures and code described in this detailed description are 
typically stored on a computer readable storage medium, which may be any device 
or medium that can store code and/or data for use by a computer system. This 
includes, but is not limited to, magnetic and optical storage devices such as disk 
drives, magnetic tape, CDs (compact discs) and DVDs (digital versatile discs or 
digital video discs), and computer instruction signals embodied in a transmission 
medium (with or without a carrier wave upon which the signals are modulated). 
For example, the transmission medium may include a communications network, 
such as the Intemet. 

Computer System 

FIG. 1 illustrates a computer system 100 in accordance with an 
embodiment of the present invention. As illustrated in FIG. 1 , computer system 
100 includes processor 102, which is coupled to a memory 1 12 and a to peripheral 
bus 1 10 through bridge 106. Bridge 106 can generally include any type of 
circuitry for coupling components of computer system 100 together. 

Processor 102 can include any type of processor, including, but not limited 
to, a microprocessor, a mainframe computer, a digital signal processor, a personal 
organizer, a device controller and a computational engine within an appliance. 
Processor 102 includes an arithmetic unit 104, which is capable of performing 
computational operations using floating-point numbers. 

Processor 102 communicates with storage device 108 through bridge 106 
and peripheral bus 110. Storage device 108 can include any type of non-volatile 
storage device that can be coupled to a computer system. This includes, but is not 
limited to, magnetic, optical, and magneto-optical storage devices, as well as 
storage devices based on flash memory and/or battery-backed up memory. 
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Processor 102 communicates with memory 1 12 through bridge 106. 
Memory 1 12 can include any type of memory that can store code and data for 
execution by processor 102. As illustrated in FIG, 1, memory 1 12 contains 
computational code for intervals 1 14. Computational code 1 14 contains 
5 instructions for the interval operations to be performed on individual operands, or 
interval values 115, which are also stored within memory 1 12. This 
computational code 1 14 and these interval values 1 15 are described in more detail 
below with reference to FIGs. 2-5. 

Note that although the present invention is described in the context of 
10 computer system 100 illustrated in FIG. 1, the present invention can generally 

operate on any type of computing device that can perform computations involving 
floating-point numbers. Hence, the present invention is not limited to the 
computer system 100 illustrated in FIG. 1. 

15 Compilmg and Using Interval Code 

FIG. 2 illustrates the process of compiling and using code for interval 
computations in accordance with an embodiment of the present invention. The 
Q system starts with source code 202, which specifies a number of computational 

operations involving intervals. Source code 202 passes through compiler 204, 
20 which converts source code 202 into executable code form 206 for interval 

computations. Processor 102 retrieves executable code 206 and uses it to control 
the operation of arithmetic unit 104. 

Processor 102 also retrieves interval values 115 fi-om memory 1 12 and 
passes these interval values 115 through arithmetic unit 104 to produce results 
25 212. Results 212 can also include interval values. 

Note that the term "compilation" as used in this specification is to be 
construed broadly to include pre-compilation and just-in-time compilation, as well 
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as use of an interpreter that interprets instructions at run-time. Hence, the term 

"compiler" as used in the specification and the claims refers to pre-compilers, 
just-in-time compilers and interpreters. 

5 Arithmetic Unit for Intervals 

FIG. 3 illustrates arithmetic unit 104 for interval computations in more 
detail accordance with an embodiment of the present invention. Details regarding 
the construction of such an arithmetic unit are well known in the art. For 
example, see U.S. Patent Nos. 5,687,106 and 6,044,454. Arithmetic unit 104 

10 receives intervals 302 and 312 as inputs and produces interval 322 as an output. 
In the embodiment illustrated in FIG. 3, interval 302 includes a first 
floating-point number 304 representing a first endpoint of interval 302, and a 
second floating-point number 306 representing a second endpoint of interval 302. 
Similarly, interval 312 includes a first floating-point number 314 representing a 

1 5 first endpoint of interval 3 1 2, and a second floating-point number 3 1 6 

representing a second endpoint of interval 312. Also, the resulting interval 322 
includes a first floating-point number 324 representing a first endpoint of interval 
322, and a second floating-point number 326 representing a second endpoint of 
interval 322. 

20 Note that arithmetic unit 104 includes circuitry for performing the interval 

operations that are outlined in FIG. 5. This circuitry enables the interval 
operations to be performed efficiently. 

However, note that the present invention can also be applied to computing 
devices that do not include special-purpose hardware for performing interval 

25 operations. In such computing devices, compiler 204 converts interval operations 
into a executable code that can be executed using standard computational 
hardware that is not specially designed for interval operations. 
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FIG. 4 is a flow chart illustrating the process of performing an interval 
computation in accordance with an embodiment of the present invention. The 
system starts by receiving a representation of an interval, such as first floating- 
point number 304 and second floating-point number 306 (step 402). Next, the 
5 system performs an arithmetic operation using the representation of the interval to 
produce a result (step 404). The possibilities for this arithmetic operation are 
described in more detail below with reference to FIG. 5. 

Interval Operations 

10 FIG. 5 illustrates four different interval operations in accordance with an 

embodiment of the present invention. These interval operations operate on the 
intervals X and 7. The interval X includes two endpoints, 

X denotes the lower bound of and 
3c denotes the upper bound of X. 

The intervals is a closed subset of the extended (including -~ and +oo) real 
15 numbers i?* (see line 1 of FIG. 5). Similarly the interval 7 also has two endpoints 
and is a closed subset of the extended real numbers i?* (see line 2 of FIG. 5). 

Note that an interval is a point or degenerate interval if X= [x, x]. Also 
note that the left endpoint of an interior interval is always less than or equal to the 
right endpoint. The set of extended real numbers, iZ* is the set of real numbers, R, 
20 extended with the two ideal points negative infinity and positive infinity: 

R^ = R^ {-oo} u {+oo}. 

In the equations that appear in FIG. 5, the up arrows and down arrows 
25 indicate the direction of rounding in the next and subsequent operations. Directed 
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rounding (up or down) is applied if the result of a floating-point operation is not 
machine-representable. 

The addition operation X+ 7 adds the left endpoint of Xto the left 
endpoint of Y and rounds down to the nearest floating-point number to produce a 
5 resulting left endpoint, and adds the right endpoint of Xto the right endpoint of Y 
and rounds up to the nearest floating-point number to produce a resulting right 
endpoint. 

Similarly, the subtraction operation X- 7 subtracts the right endpoint of Y 
from the left endpoint of X and rounds down to produce a resulting left endpoint, 

1 0 and subtracts the left endpoint of Y from the right endpoint of X and rounds up to 
produce a resulting right endpoint. 

The multiplication operation selects the minimum value of four different 
terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of X multiplied by the left endpoint of 7; the left endpoint of X 

1 5 multiplied by the right endpoint of 7; the right endpoint of X multiplied by the left 
endpoint of 7; and the right endpoint of X multiplied by the right endpoint of 7. 
This multiplication operation additionally selects the maximum of the same four 
terms (rounded up) to produce the resulting right endpoint. 

Similarly, the division operation selects the minimum of four different 

20 terms (rounded down) to produce the resulting left endpoint. These terms are: the 
left endpoint of X divided by the left endpoint of 7; the left endpoint of X divided 
by the right endpoint of 7; the right endpoint of X divided by the left endpoint of 
7; and the right endpoint of X divided by the right endpoint of 7 This division 
operation additionally selects the maximum of the same four terms (rounded up) 

25 to produce the resulting right endpoint. For the special case where the interval 7 
includes zero, X/7is an exterior interval that is nevertheless contained in the 
intervals?* 
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Note that the result of any of these interval operations is the empty interval 

if either of the intervals, X or F, are the empty interval. Also note, that in one 
embodiment of the present invention, extended interval operations never cause 
undefined outcomes, which are referred to as "exceptions" in the IEEE 754 
standard. 

Term Consistency 

FIG. 6 is a flov^ chart illustrating the process of solving a nonlinear 
equation through interval arithmetic and term consistency in accordance with an 
embodiment of the present invention. The system starts by receiving a 
representation of a nonlinear equation^x) = 0 (step 602), as well as a 
representation of an initial box with X with^ an element of X (step 604). Next, 
the system symbolically manipulates the equation^x) = 0 into a form g(X;)-^(x)= 
0, wherein the term g(Xj) can be analytically inverted to produce an inverse 
function g'^iy) (step 606). 

Next, the system substitutes the initial box X into h(X) to produce the 
equation g{X'j) = h(X) (step 608). The system then solves for = g\h(X)) 
(step 610). The resulting interval X) is then intersected with the initial interval Xj 
to produce a new interval X/ (step 612). 

At this point, ifX/ is empty, the system can terminate. Otherwise, the 
system can perform further processing. This further processing involves saving Xj 
by settingJ^^-^ =^ and also, settingJi;; - X/ (step 614). Next, ifw(X^^^) is 
sufficiently reduced at step 616, the system returns to step 606 for another 
iteration of term consistency on another term g of/x). Otherwise, the system 
terminates. 
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Examples of Applying Term Consistency 

For example, supposeX-^) - -x + 6. We can define g(x) = and 
hix)^x-6. LetX= [-10,10]. The procedural step is (Xf =X- (5= [-16,4]. Since 
(X'f must be non-negative, we replace this interval by [0,4], Solving forX', we 
5 obtainX' = + [0,2], In replacing the range of h(x) (i.e., [-16,4]) by non-negative 
values, we have excluded that part of the range h(x) that is not in the domain of 

Suppose that we reverse the roles of g and h and use the iterative step 
h{X')=g{X), Th^tisX'-6-X{ We obtain = [6,106]. Intersecting this 
1 0 result with the interval [-1 0, 1 0], of interest, we obtain [6, 1 0] . This interval 

excludes the set of values for which the range of g(X) is not in the intersection of 
the domain of h(X) withZ 

Combining these results, we conclude that any solution of 
fiX) = g(X) - h{X) = 0 that occurs mX= [-10,10] must be in both [-2,2] and 
15 [6,10]. Since these intervals are disj oint, there can be no solution in [- 1 0, 1 0] . 

In practice, if we already reduced the uiterval from [-10,10] to [-2,2] by 
solving for g, we use the narrower interval as input when solving for h. 

This example illustrates the fact that it can be advantageous to solve a 
given equation for more than one of its terms. The order in which terms are 
20 chosen affects the efficiency. Unfortunately, it is not known how best to choose 
the best order. 

Also note that there can be many choices for g(x). For example, suppose 
we use term consistency to narrow the interval bound X on a solution of 

J{x) ox^ + &x + c 0, We can let g(x) = bx and compute X' = -{aX^ + c)/i or 
25 we can let g{x) = ax^ and compute X' = ± [-{bX+c)/a] We can also separate x^ 
into x^ * x^ and solve for one of the factors X' = ± ['{bX-^c)l{aX^)f^. 
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In the multidimensional case, we may solve for a term involving more than 
one variable. We then have a two-stage process. For example, suppose we solve 
for the term l/{x+y) from the function/x,;;) = l/{x+y) - h{x,y) = 0. Let 
xeX^ [1,2] and;^ 6 7= [0.5,2]. Suppose we find that /^(J^r) = [0.5,1]. Then 
l/ix-^y) E [0.5,1] so x+y E [1,2]. Now we replace;/ by F= [0.5,2] and obtain the 
bound [-1,L5] onX Intersecting this interval with the given boundX= [1,2] on 
X, we obtain the new bound X' = [1 , 1 .5]. 

We can use X' to get a new bound on h; but this may require extensive 
computing if is a complicated function; so suppose we do not. Suppose that we 
do, however, use this bound on our intermediate resuh x + = [1,2]. Solving for j; 
as [1,2] -X\ we obtain the bound [-0.5,1]. Intersecting this interval with 7, we 
obtain the new bound 7' = [0.5, 1 ] on y. Thus, we improve the bounds on both x 
and>' by solving for a single term of f. 

The point of these examples is to show that term consistency can be used 
in many ways both alone and in combination with the interval Newton algorithm 
to improve the efficiency with which roots of a single nonlinear equation can be 
computed. The same is true for systems of nonlinear equations. 

Unconstrained Interval Global Optimization 

FIGs. 7A-7D collectively present a flow chart illustrating the process of 
solving an unconstrained global optimization problem in accordance with an 
embodiment of the present invention. This global optimization process computes 
guaranteed bounds on the globally minimum value/* of the objective function 
J{x) and guaranteed bounds on the point(s) x* where /(x) is a global minimum. 

Assume a given box (or boxes) in which the solution is sought is placed in 
a list i/ of boxes to be processed. Set wr=0. If a single box X^^^ is given, set 
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wi = w(X^^^). If more than one box is given, the system sets equal to the width 
of the largest box. 

If an upper bound on the minimum value of/x) is known, set fjbar equal 
to this value. Otherwise, the system sots f bar = +00. Note thsit fjbar is the 

5 smallest upper bound so far computed on/x), wherein x is a point in the initial 
box. Moreover, /^Z?ar can be used to eliminate from consideration any subbox 
whose lower bound is greater thmfjbar, because such a subbox cannot contain 
the globally minimxim value/*. Note that we can use the inequality /x) > >f_bar 
to delete parts of a subbox and can enhance this process for large boxes with term 

10 consistency. 

A box size tolerance €x and a function width tolerance €f are specified to 
facilitate termination when the box size and the function width become smaller 
than pre-specified values. This prevents unnecessary processing. 

We let A^^ denote the number of (consecutive) times that application of 
15 term consistency is applied to the set of relations H^i (x) ^0 (/=!,. and fails to 
reduce the current box. (Note that i/„(x) (/= 1, . . represent the diagonal 
elements of the Hessian of the objection function/x).) We begin with = 0. 

Thus, to initiahze our process, we specify Sx, Sf, wr, wj, N", and the initial 
box(es). In addition, we specify an upper bound on fJbar onf^ if one is known. 
20 We perform the following steps in the order given except as indicated by 

branching. For each step, the current box is denoted by X even though it may be 
altered in one or more steps. 

The system first evaluates / at (an approximation for) the center of each 
initial box in iy and uses the resuh to update f_bar if possible (step 701). Note 
25 that fJbar is updated whenever a new upper bound on / over a subbox X is found 
to be lower than the currentfjbar. 
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Next, for each initial box X in Lj, the system evaluates/X). The system 
then deletes any box for which the lower bound ofJ(X) is greater th3in fjyar 
(step 702). 

If Li is empty, the system goes to step 738. Otherwise, the system finds 
5 the box X in i/ for which the infimum of fiX) is smallest. The system selects this 
box as the one to be processed next. For later reference denote this box by X^^l 
The system also deletes X^^^ from iy (step 703). 

Next, the system evaluates / at (an approximation for) the center of X and 
uses the result to update /^&ar. The system skips this step if it has already been 
10 done for this box in step 701 (step 704). 

If term consistency has been applied n times in step 706 to the relation 
/x) <f_bar without having done step 710, the system goes to step 709 (step 705). 
(Note that the integer n is the number of variables upon which / depends,) 

Next, the system applies term consistency to the relation /(x) < fj)ar. If 
15 the result is empty, the system deletes X and goes to step 703 (step 706). 

If m;(X) < Ex and w(/(X))< f/r, the system puts X in list L2 and goes to step 
703 (step 707). 

If the box was sufficiently reduced in step 706, the system puts the result 
in Zy and goes to step 703 (step 708). We say that a box X is sufficiently reduced 
20 if any component of X is reduced by an amount that is at least a firaction (say 0.25) 
of the width of the widest component of X. 

If term consistency has been applied n times in step 710 to the components 
of the gradient without having applied a Newton step in step 723, the system goes 
to step 722 (step 709). 
25 Next, the system applies term consistency to gi{i)=0 (/=!,...,«) for each 

component ^^(x) of the gradient of/x). In so doing, the system bounds gi over the 
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resulting box. If a result is empty, the system deletes X and goes to step 703 
(step 710). 

Next, the system uses the center of the bounds obtained from working on 
gi in step 710 to do a line search to update /^6ar (step 71 1), A line search can be 
performed as follows. Suppose we evaluate the gradient g(x) of^x) at a point x. 
Note that / decreases (locally) in the negative gradient direction from x. A simple 
procedure for finding a point where/is small is to search along this half-line. Let 
X be the center of the current box. Define the half-line of points y(a)=x-ag(x) 
where a^O. We now use a standard procedure for finding an approximate 
minimum of the objective function / on this half-line. We first restrict our region 
of search by determining the value a' such that y(a')=x-a'g(x) is on the boundary 
of the current box and search between x and x '=y(a'). We use the following 
procedure. Each application of the procedure requires an evaluation of f. 
Procedure: lff(x')<fix), replace x by (x+x')/2. Otherwise, we replace x' by 
(x+x')/2. 

The system then uses the bounds on gj{X) {i=l,..,,n) obtained in step 710 
to apply a linear method, that uses a Taylor expansion to linearize /(x), to the 
relationXx) <fjbar (step 712). 

Next, the system repeats step 707 (step 713). 

Then, if the current box X is a sufficiently reduced version of the box X^^^ 
defined in step 703, the system puts X in list i/ and goes to 703 (step 714). 

If A^=¥, the system goes to step 720 (step 715). 

Next, the system applies term consistency to the relation Hu(X)>0 for 
/=!,...,«. If the result is empty, the system goes to step 703 (step 716). 

If the box is changed in step 716, the system sets N^=0 and goes to step 
718. Otherwise, if the box is unchanged in step 716, the system replaces by 
and goes to step 710 (step 717). 
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Next, the system repeats step 707 (step 718). 
It also repeats step 714 (step 719). 

The system then repeats steps 706-719 using box consistency in place of 
term consistency. However, the system skips steps 711-714. In step 710, the 
system omits the process of obtaining bounds on g^(X) (step 720). (Box 
consistency is described in "Global Optimization Using Interval Analysis" by 
Eldon R. Hansen, Marcel Dekker, Inc., 1992.) 

If w{X) > (wi+wr)/2, the system goes to step 732 (step 721). Note that, wj 
denotes the width of the smallest box for which we have found = B J(x,X) to 
be irregular. If is regular for a given box, wr denotes the width of the largest 
box for which we have found to be regular. Initially, we set wj =w(X^^^) and 

Wr =0. 

Next, the system computes the Jacobian J(x,X) of the gradient g. It also 
computes an approximate inverse B of the approximate center of J(x,X) and the 
matrix M(x^)=BJ(x,X) (step 722). 

The system finds the hull of the solution set of the linear system 
determined in step 722 and in the process determines if M(x,X) is regular. If 
M(x,X) is irregular, the system applies one pass of the Gauss-Seidel method to the 
linear system. The system also updates wj or wr. If the result of the Newton step 
is empty, the system goes to step 703. If the interval Newton step proves the 
existence of a solution in X, the system records this information (step 723). 

Next, the system uses the gradient value g(x) and the matrix B computed 
in step 722 to compute the point y=x-Bg(x), The system uses the value of f(y) to 
updatQ f_bar (step 724). 

The system then uses a quadratic method to solvey(x) <f_bar (step 725). 

Next, the system repeats step 707 (step 726). 

It also repeats step 714 (step 727). 
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Using the matrix B computed in step 722, the system analytically 
determines the system Bg(x), and then applies term consistency to solve the /-th 
component of Bg(x) for the z'-th variable Xi for i=^l,,..,n. If this procedure proves 
the existence of a solution in X, the system records this information (step 728). 
5 Next, the system repeats step 707 (step 729). 

It also repeats step 714 (step 730). 

Next, the system applies box consistency to solve the /-th component of 
Bg(x) as determined in step 725 for the /-th variable for /=1,...,«. (step 731). 
Next, the system repeats step 707 (step 732). 
10 If the width of the box was reduced by a factor of at least eight in the 

Newton step in step 723, the system goes to step 722 (step 733). 
□ Next, the system repeats step 714 (step 734). 

J The system then merges any overlapping gaps in components of X if any 

^.y were generated using term consistency, box consistency, and/or the Newton 

O 15 method. If there are any gaps (after merging) that are suitable for use in splitting 
r" X, the system goes to step 733. Suitability for splitting can be determined by 

ensuring that the splitting results in a sufficient size reduction in a component of 
O X. If there are no such suitable gaps, the system goes to step 737 (step 735). 

p Next, the system selects three (or fewer if fewer exist) gaps and splits X 

" ^ 20 into subboxes. The system then goes to step 703 (step 736). 

The system then splits the three (or n if n<3) components of X for which 
Dj {i=U,..,n) is largest The system splits each such component by using a 
technique that gives preference to components over which the objective function 
varies the most. For example, we can approximate the change in / resulting from 
25 the /-th variable changing over X, using the approximation Z), = w[g,(X)]v^'(Xi). 
The system then goes to step 703 (step 737). 
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Next, the system deletes any box X from list L2 for which the lower bound 
of XX) > f_bar. The system denotes the remaining boxes by X^^V^-.X^^^ where s is 
the number of boxes remaining. The system determines the lower bound for the 
global minimum/^ as the minimum over i=l<i<n of the lower bound ofy(X^*^) 
5 (step 738). 

Finally, the system terminates (step 739). 

Discussion of the Process 

The above-described process begins with procedures that involve the least 
10 amount of computing. We use term consistency first because it does not require 

computation of derivatives (and because it is effective in reducing large boxes). 
For efficiency, the best box X to process is the one for which the lower 

bound of XX) is smallest. This tends to quickly reduce the upper bound/^6ar. 

Because it is important to reduce/^6ar as quickly as possible, the process retums 
15 to step 703 frequently to determine the box to process next. 

We stop using the relations //„(x)>0 (z=1,...,m) when there is evidence that 

the remaining boxes are in a region where /is convex. See step 715. Using a 

similar philosophy, we begin using the Newton method more often when there is 

evidence that it will be effective. See step 721. 
20 Note that the Jacobian J(x,X) is the Hessian of the objective function / 

Therefore, knowing J(x,X) provides the means for determining the second order 

Taylor expansion of f. The gradient g(x) needed for this expansion is also 

computed when applying the Newton method in step 723. 

Therefore, we have the data required to use the quadratic method to solve 
25 the relationXx) < fj)ar. See step 725. If these data are not already available, it 

may not be worth generating them simply to solve^x) <fj)ar using the quadratic 

method. 
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Step 717 is done after term consistency is applied to the relation Hii{X)>0, 
When the box is unchanged, the count is increased by 1 . Step 717 is repeated 
after box consistency is applied to what may be the same box. See step 720. 
Therefore, the count can increase faster than one might otherwise expect. 

The algorithm avoids too many applications of term consistency before 
changing procedures. Step 705 can force application of term consistency to the 
gradient instead of to the inequality / < f_bar. Step 709 can force change from 
applying term consistency to the gradient to applying a Newton step. This takes 
precedence over our desire not to apply a Newton step when it may not be 
efficient. 

We need occasional checks of efficiency of the Newton method because 
the current box may have become so small that the Newton method exhibits 
quadratic convergence and thus is more efficient than term consistency. When we 
force a change from term consistency, we also force a change from box 
consistency. This occurs because we do not apply the latter without having 
previously applied the former. 

Step 733 causes the Newton step to be repeated. If the Newton method is 
exhibiting quadratic convergence, we want to take advantage of its ability to 
progress rapidly. 

The foregoing descriptions of embodiments of the present invention have 
been presented only for purposes of illustration and description. They are not 
intended to be exhaustive or to limit the present invention to the forms disclosed. 
Accordingly, many modifications and variations will be apparent to practitioners 
skilled in the art. For example, although the present invention describes the use of 
derivatives in certain situations, it is often possible to use slopes instead of 
derivatives. 
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Additionally, the above disclosure is not intended to limit the present 
invention. The scope of the present invention is defined by the appended claims. 
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